首页> 外文OA文献 >An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

【2h】

An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

机译：一种共同评估工人绩效的主动学习方法和众包数据的注释可靠性

页面导航

摘要
著录项
相似文献
相关主题

摘要

Crowdsourcing platforms offer a practical solution to the problem ofaffordably annotating large datasets for training supervised classifiers.Unfortunately, poor worker performance frequently threatens to compromiseannotation reliability, and requesting multiple labels for every instance canlead to large cost increases without guaranteeing good results. Minimizing therequired training samples using an active learning selection procedure reducesthe labeling requirement but can jeopardize classifier training by focusing onerroneous annotations. This paper presents an active learning approach in whichworker performance, task difficulty, and annotation reliability are jointlyestimated and used to compute the risk function guiding the sample selectionprocedure. We demonstrate that the proposed approach, which employs activelearning with Bayesian networks, significantly improves training accuracy andcorrectly ranks the expertise of unknown labelers in the presence of annotationnoise.

机译：众包平台提供了一种实用的解决方案，可以对负担得起的大型数据集进行可负担的注释以训练受监督的分类器。不幸的是，糟糕的员工绩效经常会威胁到注释的可靠性，并且每个实例都要求多个标签会导致成本大量增加而无法保证良好的结果。使用主动学习选择程序将所需的训练样本最小化可减少标记要求，但会通过集中于错误的注释而危害分类器训练。本文提出了一种主动学习的方法，在该方法中，工人的绩效，任务难度和注释可靠性被共同估计，并用于计算指导样本选择过程的风险函数。我们证明了所提出的方法，它与贝叶斯网络一起使用主动学习，可以显着提高训练准确性，并在存在注释噪声的情况下正确地对未知标签的专业知识进行排名。

著录项

作者
Zhao, Liyue; Zhang, Yu; Sukthankar, Gita;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Reliability of the TTC approach: Learning from inclusion of pesticide active substances in the supporting database [J] . Feigenbaum Alexandre, Pinalli Roberta, Giannetto Marco, Food and Chemical Toxicology: An International Journal Published for the British Industrial Biological Research . 2015,第Null期

机译：TTC方法的可靠性：从支持数据库中纳入农药活性物质中学习
2. Reliability of the TTC approach: Learning from inclusion of pesticide active substances in the supporting database [J] . Feigenbaum Alexandre, Pinalli Roberta, Giannetto Marco, Food and Chemical Toxicology: An International Journal Published for the British Industrial Biological Research . 2015,第Null期

机译：TTC方法的可靠性：学习在支持数据库中包含农药活性物质
3. Separate or joint? Estimation of multiple labels from crowdsourced annotations [J] . Lei Duan, Satoshi Oyama, Haruhiko Sato, Expert Systems with Application . 2014,第13期

机译：分开还是联合？从众包注释中估计多个标签
4. Robust Active Learning Using Crowdsourced Annotations for Activity Recognition [C] . Liyue Zhao, Gita Sukthankar, Rahul Sukthankar AAAI Workshop on Human Computation . 2011

机译：使用众群注释进行活动识别的强大主动学习
5. Understanding Mobility and Active Transportation in Urban Areas Through Crowdsourced Movement Data [D] . Conrow, Lindsey. 2018

机译：通过众包运动数据了解城市地区的出行和主动交通
6. The Accuracy and Reliability of Crowdsource Annotations of Digital Retinal Images [O] . Danny Mitry, Kris Zutis, Baljean Dhillon, -1

机译：数字视网膜图像众包注释的准确性和可靠性
7. Incremental Relabeling for Active Learning with Noisy Crowdsourced Annotations [O] . Liyue Zhao, Gita Sukthankar, Rahul Sukthankar 2011

机译：增量式重新标注，用于带有嘈杂众包注释的主动学习

An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

摘要

著录项

相似文献

相关主题

期刊订阅